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PREDICTING RESPONSE AND OUTCOME 
OF METASTATIC BREAST CANCER ANTI-ESTROGEN THERAPY 

BACKGROUND 

Resistance to anti-estrogens is one of the major challenges in the treatment of 
breast cancer. For more than 25 years, the golden standard for the endocrine treatment of 
all stages of estrogen receptor-positive breast cancer has been tamoxifen (Jordan, 2003, 
Nat Rev. DrugDiscov., 2:205-13; Osborne, 1998, N. Engl. J. Med. 339:1609-18). 
However, in the advanced setting when metastasis is detected approximately half of the 
patients with estrogen receptor-a (ER-a) -positive breast tumors will not respond to 
endocrine treatment, whereas response rates in patients with ER-oc-negative primary 
tumors are very low. Therefore additional biomarkers are needed to identify patients who 
will not respond and to select patients for various tailored treatments. 

In the past 20 years a large number of cell biological factors, other than steroid 
receptors, has been reported that identify those patients who will benefit from endocrine 
therapy or fail to respond (for review see Klijn et al.:, 2002, Ingle WRMaJN (ed): 
Endocrine Tlierapy in Breast Cancer. New York, Marcel Dekker). Few of these, 
however, appeared valuable and useful in daily clinical practice. In these individual 
studies only a limited number of factors have been evaluated simultaneously. 
Breast cancer is known as a heterogeneous and multifactorial disease, with accumulation 
of (epi)genetic alterations leading to transformation of normal cells into cancer cells. 
With the advent of high-throughput quantification of gene-expression, simultaneous 
assessment of thousands of genes is now possible in a single experiment (Brown et al., 
1999, Nat. Genet. 21:33-7; Holloway, et al., 2002, Gynecol. Oncol, 87:8-16, 2002). 
Gene-expression profiling provides a strategy for discovering gene-expression 
characteristics that may be useful to predict clinical outcome. 

SUMMARY OF THE INVENTION 

Using microarray expression profiling, gene signatures, marker genes, and 
methods were developed for predicting response or resistance to anti-estrogen, for 
example, tamoxiphen therapy and predicting outcome for recurring breast cancer patients. 
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Using a gene profile described herein, analysis of a patient's primary breast tumor against 
the gene profile is predictive of patient response to anti-estrogen, for example, 
tamoxiphen therapy, for example, tamoxifen therapy for the treatment of recurring breast 
cancer. 

Useful gene signatures for predicting outcome (response or resistance) and 
progression free survival in recurring breast cancer treated with anti-estrogen, for 
example, tamoxiphen therapy and include the genes of the 81 -gene signature and the 44- 
gene signature shown in Figure 2. As shown in Figure 2A, a Cluster I expression pattern 
of marker genes correlates with progressive disease; a Cluster II expression pattern of 
marker genes correlates with Objective Respons. In one embodiment, a set of two or 
more marker genes is predictive. The gene signature may comprise at least one, and 
preferably at least two of FN-1, CASP-2, THRAP-2, SIAH-2, DEME-6, TNC, and COX- 
6C. In a specific embodiment, the gene signature comprises at least one of DEME-6 and 
CASP2, and at least one of SIAH-2 and TNC. 

Gene expression levels can be determined using various known methods 
including nucleic acid hybridization in microarrays, nucleic acid amplification methods 
such as quantitative polymerase chain reaction (qPCR), and immunoassay of proteins 
expressed by the genes of the predictive gene profile. Expression levels and expression 
level ratios of two or more genes of the predictive gene profile can be determined, for 
example, using real-time quantitative reverse-transcriptase PCR (qRT-PCR). 

The gene signatures of the invention are useful in assays to predict response 
and/or outcome of anti-estrogen, for example, tamoxiphen therapy for recurring breast 
cancer. In one embodiment, gene expression is analyzed in a primary breast tumor tissue 
sample and compared to the expressed gene signature determined from retrospective 
patient data as described in the Examples below. Sample expression data can be 
analyzed against a classification algorithm determined from a ''training" set of data as 
described in the Examples below. 

In another embodiment, a gene expression ratio of two or more genes, or a 
threshold expression level of one or more predictive genes is analyzed. In a preferred 
embodiment, expression of at least one upregulated gene and at least one down regulated 
gene is analyzed. A ratio of the expression of the upregulated gene to that of the down 
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regulated gene is calculated, where the ratio is predictive of response and/or outcome of 
anti-estrogen, for example, tamoxiphen therapy for treating recurring breast cancer. The 
predictive ratio or ratios may be stored in a database for comparison to the test data. 

The invention includes diagnostic systems and methods such as arrays containing 
one or more probes to detect expression of one or more genes of the predictive profile. 
Preferably, the assay system contains at least one of the genes of the 81 -gene signature or 
of the 44- gene signature shown in Figure 2. In one embodiment, the system contains 
two or more of these genes. The assay system may comprise at least one, and preferably 
at least two of FN-1, CASP-2, THRAP-2, SIAH-2, DEME-6, TNC, and COX-6C. In a 
specific embodiment, the assay system comprises at least one of DEME-6 and CASP2, 
and at least one of SIAH-2 and TNC. 

The gene signatures of the invention are also useful for identifying lead 
compounds useful in the treatment of estrogen-dependent recurring breast cancer. 
Primary estrogen-dependent breast tumor tissue can be contacted with the potential 
therapeutic drug, and the expression of one or more genes of the gene signature analyzed 
and compared with an untreated control. 

These and other features of the invention are described more fully below. 

BRIEF DESCRIPTION OF THE FIGURES 

i 

Figure 1 is a flow chart showing study design and gene selection procedure. 

Figure 2 A and B show a heat map showing clusters of 46 tumors using the 81- 
gene signature. Cluster 1 shows gene expression correlated with progressive disease; 
Cluster 2 shows gene expression correlated with objective response. Genes upregulated 
are shown in red; those downregulated are shown in green. The genes of 8 1 -gene 
signature are listed, and those of the 44-gene signature are indicated by bars at the right 
side of the heat map and also listed. NCBI Accession numbers are shown. Bars side of 
the heat map show genes linked to apoptosis (black), extracellular matrix (purple), and 
immune system (blue). 

Figure 3 shows a series of progression free survival graphs as a function of gene- 
signature classification and traditional factors. Progression free survival curves after start 
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of tamoxifen therapy are shown for the validation set of 66 patients grouped according to 
the traditional factors based score (panels A and B) or the 44-gene signature (panel C). 

Figure 4 is a plot generated with BRB Array Tools showing chromosomal 
distribution of genes of the entire set analyzed (14557 genes, red bars) and those of a 
subset of 6 genes of the signature (blue). 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 

Definitions 

Gene Signature as used herein, refers to a profile of gene expression that 
correlates with a therapeutic outcome, for example as shown in the heat map of Figure 
2A. 

Cluster I and Cluster II gene profiles are shown in Figure 2A as correlating with 
progressive disease (Cluster I) and objective response (Cluster II). 

Differential expression, as used herein, refers to gene expression in primary 
breast tumor tissues that differs with a patient's outcome in treatment of recurring breast 
cancer with anti-estrogen therapy. 

Objective response as used herein includes complete remission and partial 
remission. 

Outcome as used herein, refers to Response (complete or partial) or Resistance 
(progressive disease or stable disease less than 6 months). 

Recurring disease or recurring breast cancer is used herein to mean cancer that 
develops after the primary breast cancer has been removed, for example metastatic breast 
cancer that occurs after a primar tumor has been excised. 

Stable disease refers to patients with no change in disease status, as well as those 
with no evident tumor reduction of at least 50% or more and those with tumor 
progression. Patients with stable disease are divided into those with no change (stable 
disease) for six months or longer, and those with no change (stable disease) for less than 
six months. 

Tumor progression or Progressive Disease, as used herein, is meant to describe 
growth of about 25% or more tumor mass, or one or more new lesions within a three- 
month period. 
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Because patients with stable disease for 6 months or more exhibited a PFS similar 
to patients with partial remission, these patients were classified as responders to 
tamoxifen as described in the manual for clinical research and treatment in breast cancer 
of the European Organization for Research and Treatment of Cancer. European 
Organization for Research and Treatment of Cancer Breast Cancer Cooperative Group. 
Manual for clinical research and treatment in breast cancer, Almere, The Netherlands: 
Excerpta Medica; 2000. p. 1 16—7. 

Clinical benefit was defined in the studies described herein as objective response, 
including complete and partial remission, and stable disease for six months or more, as 
described in Ravdin et al., 1992, J.Clin. Oncol. 10:1284-91; Foekens^a/., 1994, Br. J. 
Cancer, 70:1217-23; and Robertson^ al, 1997, Eur. J. Cancer, 33:1774-9. 

Only patients with measurable disease were evaluated in these studies, and 
selected patients with no change (stable disease), had received tamoxifen at least for a 
period of 6 months. 

Gene Expression Profiling 

Gene expression profiling of retrospective breast cancer tumor tissue using high 
density cDNA arrays was used herein to generate differential gene expression patterns 
correlated with patient response and outcome data for treatment of recurrent breast cancer 
with anti-estrogen, for example, tamoxiphen therapy. Using tumor RNA obtained from a 
training set of 46 tumors comprising primary tumors from 25 patients exhibiting 
progressive disease after anti-estrogen, for example, tamoxiphen therapy for recurring 
breast cancer and primary tumors from 21 patients exhibiting objective response to anti- 
estrogen, for example, tamoxiphen therapy for recurring breast cancer, differentially 
expressed genes/ests were identified. Using microarray data analysis tools, (BRB Array 
Tools), under a significance level of 0.05, a total of 569 and 449 genes were identified as 
differentially expressed and correlated with progressive disease and objective response, 
respectively. 
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81-gene Signature 

The overlap of these differentially expressed genes identified an initial signature 
set of 81 differentially expressed genes having a pattern correlated with progressive 
disease or objective response for anti-estrogen, for example, tamoxiphen therapy 
treatment of recurring breast cancer. These 81 genes were classified and subjected to 
cluster analysis. The results are shown in the heat map of Figure 2, with a signature 
pattern of gene expression correlated with predictable response and/or outcome. Genes 
that were upregulated in the pattern are indicated in red, while genes that were 
downregulated are shown in green. Gene clustering is also shown by overlapping bars 
shown on the sides of the expression map. 

This 81-gene signature was used to correctly classify retrospective patient 
samples as having a gene expression pattern correlated with progressive disease or with 
objective response to anti-estrogen, for example, tamoxiphen therapy in the treatment of 
metastatic (recurring) breast cancer. 21 of 25 patients with progressive disease and 19 of 
21 patients with objective response were correctly classified by this 81-gene signature, as 
discussed in the Examples below. 

44-gene Signature 

With further analysis and rank ordering of genes on the basis of significance level, 
followed by a step-up calculation of correlation coefficient of expression, a supervised 
learning approach was used to reduce the original 81-gene signature to a smaller 44-gene 
predictive signature having similar accuracy. 

Using a validation set of 66 tumors, the 44-gene signature correctly classified 27 
of 35 patients with progressive disease and 15 of 31 patients with objective response. 
Univariate analysis showed the response predictions by the 44-gene signature to be 
superior to predictions based on the analysis of traditional factors such as menopausal 
status, disease-free interval, first dominant site of relapse, estrogen and progesterone 
receptor status. 

Univariate and multivariate analysis showed the 44-gene signature to be 
predictive of progression free survival, e.g., the time until tumor progression was seen. 
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Individual Signature Genes 

Expression levels of mRNA from individual genes in the 81 -gene signature were 
measured by quantitative PCR as disclosed in the Examples below. The qPCR data was 
correlated with the mircroarray data. Of eight tested genes (CASP2, DLX2, USP9X, 
CHD6, MST4, RABEP, SIAH2, and TNC), Spearman rank correlations were positive for 
all except for MST4. 

Functional Clusters of Signature Genes 

The genes in the 81 -gene predictive signature contained 15 ESTs and 66 known 
genes. See the listing of genes in Figure 2. Functional annotation of these genes showed 
clusters of genes involved with estrogen action, apoptosis, extracellular matrix formation, 
and immune response. Additional genes function in glycolysis, transcription regulation, 
and protease inhibition. 

Seventeen genes were regulated by or associated with estrogen (receptor) action, 
with 9 genes upregulated (LOC51186; TSC22; TIMP3; SPARC; GABARAPL1; CFP1; 
LDHA; EN02; Hs. 99743) and 8 genes downregulated (TXN2; CDC42BP4; HLA-C; 
PSME1; Hs. 437986; SIAH2; UGCG; FMNL) in the primary tumors of tamoxifen- 
resistant patients, as shown in Figure 2. 

Six genes associated with the extracellular matrix (TIMP 3, FN1, LOX, 
COL1 Al, SPARC, AND TNC) and were overexpressed in patients with tamoxifen- 
resistant disease. Another cluster of seven genes were associated with apoptosis (EL4R, 
LDHA, MSP2K4, NPM1, SIAH2, CASP2, and TXN2), while two genes were related to 
anti-apoptosis activities (API 5, BNIP3). Four apoptosis genes were upregulated (API 5, 
NPM1, LDHA, BNIP3), while the other 5 were downregulated in primary tumors of 
patients with tamoxifen-resistant disease. A cluster of 4 genes linked to the immune 
system was downregulated (FCGRT, PSME-1, HLA-C, and NFATC3). 

Chromosome 17 

The 81 -gene signature contains a significant number of genes located on 
chromosome 17, and particularly localized to cytoband 17q21-q22. For example, 5 of 66 
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(6.5%) informative genes (APPBP2, COL1A1, EZH1, KIAA0563, and FMNL) are 
localized to this cytoband, as compared with 199 of 12771 known genes (1 .3%) for the 
entire microarray. 

Tissue Samples 

Breast cancer tissue samples useful for diagnostic assay can be obtained from 
primary tumor tissue, for example, biopsy tissue. In some instances, RNA may be 
obtained from the sample and used directly for analysis of expression. In general, RNA 
extracted from the tissue will be amplified, e.g., by polymerase chain reaction. For 
protein analysis, tissue can be paraffin-embedded and sectioned, for example, for 
immunohistochemstry and in situ hybridization analyses. 

Analysis of Gene Expression 

In one embodiment, primary breast tumor tissue is analyzed for mRNA 
transcripts, for example, by hybridizing to cDNA probes. In another embodiment, the 
tissue is analyzed for protein, for example by immunoassay, for example, immunohisto 
chemistry. Individual genes of the 81 -gene signature are known. NCBI Accession 
Numbers provided in Figure 2 can be used to provide the nucleic acid and polypeptide 
sequences. Appropriate nucleic acid probes for hybridization and/or antibodies for 
immunoassay can be generated using known methods. 

Gene expression in the primary tumor tissue sample is compared with the 
expression pattern of one or more marker genes identified from the 81 -gene signature or 
from genes identified from cluster analysis and association with the genes of the 81 -gene 
signature, as disclosed in the Examples below. 

A nucleic acid marker as used herein is a nucleic acid molecule that, by its 
expression pattern in primary breast tumor tissue, alone or in combination with or 
compared with the expression patterns of one or more additional nucleic acid molecule, 
correlates with response or resistance to anti-estrogen, for example, tamoxiphen therapy 
for recurring breast cancer, or with outcome, such as progressive disease, stable disease, 
or progression- free survival. 
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Hybridization methods useful to analyze gene expression are well known. 
Nucleic acid molecules in the tumor tissue, for example mRNA, can hybridize under 
stringent hybridization conditions with a complementary nucleic acid probe. The nucleic 
acid hybridization probe need not be a full-length molecule, but can be a fragment or 
portion of the a fragment of the full-length cDNA, a variant thereof, a SNP, or iRNA. 
The probe can also be degenerate, or otherwise contain modifications such as nucleic acid 
additions, deletions, and substitutions. What is required is that the probe retain its ability 
to bind or hybridize with the sample nucleic acid molecule, in order to recognize the 
expressed product in the sample. 

Assay Methods 

Marker gene expression can be analyzed by known assay methods, including 
mehtods for detecting expressed nucleic acid molecules, such as RNA and encoded 
polypeptides. Nucleic acid probes and polypeptide binding ligands useful in such 
methods, can be prepared by conventional methods or obtained commercially. Detection 
of expression can be direct or indirect, using know labels and detection methods. 

For analysis of nucleic acid molecules, standard methods, for example, 
microarray technology and qRT-PCR can be used to identify patterns of nucleic acid 
expression in the sample tissue. Methods of microarray technology, including DNA chip 
technology, gene chip technology, solid phase nucleic acid array technology, multiplex 
PCR, nucleic-acid spotted fluidity cards, and the like, are known, and may be used to 
determine the expression patterns of nucleic acid molecules in a patient's tumor sample. 
In one embodiment, array of identified nucleic acid probes is provided on a substrate. In 
a preferred embodiment, the expression of signature genes is assayed by qPCR 
techniques. 

For analysis of expressed polypeptides, known binding assay methods, such as 
immunoassay methods can be used. Examples include imunohitochemistry, ELISA, 
radioimmunoassay, BIACore, and the like. 

EXAMPLES 
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The invention is described herein with reference to the following examples that 
serve to illustrate the embodiments of the invention, and are not intended to limit the 
scope of the invention in any way. - » 

Example 1 

Identification of a Predictive Gene Signature 

The Examples below describe studies undertaken to determine the measurable 
effect of anti-estrogen, for example, tamoxiphen therapy for breast cancer on tumor size 
and on time until tumor progression (progression free survival). The analysis was 
performed on 1 12 estrogen receptor positive primary breast cancer samples from patients 
who developed advanced disease that showed the most pronounced types of response 
(objective response versus progressive disease from the start of treatment). In addition, 
these studies describe underlying gene (signaling) pathways that provide novel potential 
targets for therapeutic intervention. 

METHODS 

Patients and treatment 

The study design was approved by the medical ethical committee of the Erasmus 
MC Rotterdam, the Netherlands (MEC 02.953). To evaluate the predictive value of 
gene-expression profiling in relation to tamoxifen treatment in patients with recurrent 
breast cancer, 1 12 fresh frozen ER-ot-positive (>=10 frnol/mg of protein) primary breast 
tumor tissue specimens of patients with primary operable (invasive) breast cancer 
diagnosed between 1981 and 1992 were included. The median age at time of primary 
surgery (breast conserving lumpectomy, 33 patients; modified mastectomy, 79 patients) 
was 60 years (range, 32-89 years). 

In this retrospective study, all patients were selected for disease recurrence (14 
local or regional relapse, 86 distant metastasis) that was treated with tamoxifen (40 mg 
daily) as first-line treatment. At the start of tamoxifen treatment, the median age was 63 

■ 

years (range 33-90 years), and 27 patients (24%) were premenopausal. None of the 
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patients had received endocrine (neo)adjuvant systemic therapy nor were exposed to any 
hormonal treatment at an earlier stage, i.e. hormo-naive. 

Eighteen patients (16%) received adjuvant chemotherapy. Of these patients, 7 were 
postmenopausal whereas 1 1 were premenopausal at time of surgery. At start of 
tamoxifen monotherapy 8 patients were still premenopausal, whereas 3 patients changed 
to the post-menopausal status before recurrence. Two of these three patients showed 
objective response to tamoxifen. Therefore, chemocastration as prior endocrine therapy 
could not have had a significant impact on the results. 

The median follow-up of patients alive was 94 months (range, 21-165 months) 
from primary surgery, and 53 months (range, 2-131) from the start of tamoxifen 
treatment. Tumor progression after tamoxifen occurred in 103 (92%) of the patients. 
During follow-up, 94 patients (84%) died. After tumor progression on first-line 
tamoxifen treatment 69 patients were treated with one or more additional endocrine 
agents, while 64 patients were subsequently treated with one or more regimens of 
chemotherapy such as cyclophosphamide methotrexate 5-Fluorouracil (CMF) or 5- 
fluorouracil, adriamycin, cyclophosphamide (FAC) after the occurrence of hormonal 
resistance. 

Criteria for follow-up, type of response, response to therapy was defined by 
standard UICC criteria (Hayward, et al., 1977, Cancer, 39:1289-94), and for progression 
free survival Were described previously (Foekens, et al., 2001, Cancer Res., 61:5407-14). 
Complete and partial response (CR and PR) was observed in 12 and 40 patients, 
respectively, resulting in 52 patients with an objective response (OR); progressive disease 
(PD) within 3-6 months from start of treatment was observed in 60 patients. Median 
progression free survival -time of objective response was 17 months, whereas the median 
progression free survival-time of patients with progressive disease was 3 months. 

RNA isolation, amplification and labeling 

Total RNA was isolated from 30 fim frozen sections (approximately 20-50 mg 
tumor tissue) with RNABee (Campro Scientific). The percentage of tumor cells was 
determined in two Haematoxylin eosin stained frozen 5 /im sections that were cut before 
and after sectioning for RNA isolation. The tumor samples had a median tumor content 
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of 65%. A T7dT oligo primer was used to synthesize double-stranded cDNA from 3 jig 
total RNA and subsequently to generate aRNA by in vitro transcription with T7 RNA 
polymerase (T7 MEGAscript™ High Yield Transcription kit, Ambion Ltd., Huntingdon, 
UK). Two micrograms of aRNA was labeled with Cy3 or Cy5 (CyDye, Amersham 
Biosciences) in a reverse transcription reaction. The labeled cDNA probes were purified 
using Qiagen PCR clean up columns (Qiagen, Westburg BV, Leusden, The Netherlands). 

Similar to the Stanford protocol, a cell line pool of 13 cell lines derived from 
different tissue origins was used as reference for all microarray hybridizations (details are 
available at MIAMExpress (http://www.ebi.ac.uk/miamexpress /). Probes of the cell line 
pool were always labeled with Cy5. 

Quantitative real-time PCR 

Total RNA isolated for the microarray analysis was used to verify the quantity of 
specific messengers by real-time PCR. The RNA was reverse-transcribed and real-time 
PCR products were generated in 35 cycles from 15 ng cDNA in an ABI Prism 7700 
apparatus (Applied Biosystems, Foster City, USA) in a mixture containing SYBR-green 
(Applied Biosystems, Stratagene) and 330 nM primers for differentially expressed genes 
(i.e. CASP2, DLX2, EZH1, CHD6, MST4, RABEP, SIAH2, and TNC). SYBR-green 
fluorescent signals were used to generate Cycle threshold (Ct) values from which mRNA 
ratios were calculated when normalized against the average of three housekeeping genes, 
i.e. hypoxanthine- guanine phospho-ribosyltransferase (HPRT), porphobilinogen 
deaminase (PBGD), and /?-2-microglobulin (B2M) (Martens, et al., 2003, Tliromb. 
Haemost. , 89:393-404). 

cDNA microarrays: preparation, hybridization, and data acquisition 

Microarray slides were manufactured at the Central Microarray Facility at the 
Netherlands Cancer Institute (Weige, et al., 2003, Proc. Natl Acad. Set U.S.A., 
100:15901-5). Sequence-verified clones obtained from Research Genetics (Huntsville, 
AL) were spotted with a complexity of 19,200 spots per glass slide using the Microgrid II 
arrayer (Biorobotic, Cambridge, U.K.) The gene ID list can be found at 
http://microaiTavs.nki.nl . Labeled cDNA probes were heated at 95°C for 2 minutes and 
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added to preheated hybridization buffer (Slide hybrization buffer 1, Ambion). The probe 
mixture was hybridized to cDNA micro arrays for 16 hours at 45 °C. 

Fluorescent images of microarrays were obtained by using the GeneTAC™ LS II 
microarray scanner (Genomic Solutions; Perkin Elmer). IMAGENE v5.5 (Biodiscovery, 
Marina Del Rey, CA) was used to quantify and correct Cy3 and Cy5 intensities for 
background noise. Spot quality was assessed with the flagging tool of IMAGENE, in this 
study set at R>2 for both Cy3 and Cy5. Fluorescent intensities of each microarray were 
normalized per subgrid using the NKI MicroArray Normalization Tools 
(http : //dexter. nki . nD to adjust for a variety of biases that affect intensity measurements 
(e.g. color-, print tips, local background bias) (Y ang, et al., 2002, Nucleic Acids Res., 
30:el5). All ratios were log2 transformed. 

Data analysis and statistics 

Microarray data analyses were performed with the software packages BRB Array 
Tools, developed by the Biometric Research Branch of the US National Cancer Institute, 
(http://linus.nci.nih.gov/BRB-ArrayTools.html), and Spotfire (www.spotfire.com, 
Goteborg, Sweden and Sommerville, MA). BRB was implemented for statistical analysis 
of microarray data whereas Spotfire was used for cluster analysis. The class comparison 
tool in BRB combines a univariate F-test and permutation test (n=2000) in order to find 
discriminating genes and to confirm their statistical significance. In the class comparison 
a significance level of 0.05 was chosen in order to limit the number of false negatives. 

Spotfire was used to perform hierarchical clustering. To analyze microarray data 
from different batches of slides, genes were Z-score normalized per batch. The Z-score 
was defined as [value - mean]/SD. After normalization, microarray data were clustered 
via complete linkage. The similarity measure for clustering was based on cosine 
correlation and average value. 

Sensitivity, specificity, positive and negative predictive value (PPV and NPV, 
respectively) and odds ratios (OR) were calculated and presented with their 95% 
confidence interval (CI). The data are shown in Table 2. The performance of the 
signature in the validation set was determined via the likelihood ratio of the Chi square 
test. A supervised learning approach was applied to reduce the 8 1 differentially 
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expressed genes to a smaller 44-gene predictive signature. First, all 8 1 genes were rank 
ordered on the basis of their significance as calculated with the BRB class comparison 
tool. Next, starting with the most significant gene, the Pearson correlation coefficient of 
expression with the other 80 genes was calculated. Succeeding genes were excluded 
from the signature as long as their expression correlated significantly (P<0.05) with the 
most significant gene. The first gene of the 81 gene profile that did not correlate with 
expression of the most significant gene was added to the final signature, and the whole 
procedure of expression correlation analysis with this second gene was repeated with the 
remaining less significant genes. In this way, genes with overlap in their expression were 
removed and the 44-gene predictive signature was derived. 

The predictive score for the traditional-based model included menopausal status, 
disease free interval (DFI>12 months versus DFI<12 months after primary surgery), 
dominant site of relapse (relapse to viscera or bone versus relapse to soft tissue), log 
estrogen receptor (ER) and log progesterone receptor (PgR) levels. In the analyses of 
progression free survival, the Cox proportional hazards model was used to calculate the 
hazard ratios (HRs) and 95% CI. Survival curves were generated using the method of 
Kaplan and Meier (1958, J. Am, Stat Assoc., 53:457-481) and a log rank test for trend 
was used to test for differences. Correlation between microarray data and real-time PCR 
data was determined with Spearman rank correlation test. Computations were performed 
with the STATA statistical package, release 7.0 (STATA Corp., College Station, TX). 
All p-values are two-sided. 

Method of classification 

For the validation of the 44-gene signature, a classification algorithm (Gene 
Prediction Tool (GPT)) was developed that is comparable to the Compound Covariate 
Predictor (CCP) from BRB Array Tools. In detail, GPT applies two cut-off values 
instead of the midpoint used in the CCP tool for classification. The two thresholds are 
the median values of progressive disease and objective response and are defined in the 
tumors of the training set. 

To obtain a robust classification algorithm, genes from the signature only become 
classifiers whenever the expression values are outside the two thresholds and as a result 
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mainly represent one class, either progressive disease or objective response. When the 
expression level falls between the two cut-off values, the gene is excluded as classifier 
because the value can represent both response classes, i.e. progressive disease but also 
objective response. The gene classifiers from the predictive 44-gene signature are 
identified for each tumor from the validation set using the algorithms described herein. 
Finally, the ratio between the identified response predicting genes and resistance 
predicting genes determines the predicted signature-based response outcome. 

MATHEMATICAL ALGORITHM FOR GENE PREDICTION TOOL 

Threshold Objective Response for gene x: Kx-MEDIAN (Mx:AKx) 

Threshold Progressive disease for gene x: Jx=MEDIAN(ALx:BFx) 

Classification Constant for gene x: Lx—IF(Kx> ==Jx, 1, -J) 

Constant for gene x: either 1 for response or -1 for resistance 

Gene x Tumor Classification: My^$Lx^IF(Mx>MAX($Jx t $Kx)J t IF(MTN($Jx t $Kx) t -l i 0)) 

Tumor classification for gene x: either 1, or 0 (= not informative) 

* Operation as performed on Excel spreadsheet 

RESULTS 

Selection of differentially expressed genes and predictive signature 

To select discriminatory genes for the type of response to tamoxifen, a training set 
of 46 tumors was defined that comprised primary tumors of 25 patients with progressive 
disease (PD) and of 21 patients with objective response (OR, see Figure 1). The tumor 
RNAs of this training set were hybridized, in duplicate, and genes/ESTs that had less than 
90% present calls over the experiments were eliminated. This resulted in 8555 and 7087 
evaluable spots, respectively. Using a significance level of 0.05 in the BRB class 
comparison tool, 569 and 449 genes, respectively, were differentially expressed between 
the progressive disease and objective response subsets. The overlap, i.e. 81 genes, was 
designated as the differentially expressed signature. 

After supervised hierarchical clustering (shown in Figure 2), this discriminatory 
signature correctly classified 21 of 25 patients with progressive disease (84% sensitivity; 
95% CI: 0.63-0.95) and 19 of 21 patients with objective response (91% specificity, 95% 
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CI 0.68-0.98) with an odds-ratio of 49.8 (pO.OOOl). The positive predictive value and 

* 

negative predictive value for resistance to tamoxifen were 91% and 83%, respectively. 
Further analysis, rank-ordering of genes on the basis of significance level, followed by a 
step-up calculation of correlation coefficient of expression, reduced the initial set of 81 
genes to a smaller 44-gene predictive signature with similar accuracy. 

Example 2 

Validation of Predictive Gene Signature: 
Correlation to Clinical Response and Time to Treatment Failure 

Type of Response 

In a validation set of 66 tumors, the predictive 44-gene signature correctly 
classified 27 of 35 patients with progressive disease (77% sensitivity, 95% CI: 0.59-0.89) 
and 15 of 31 patients with objective response (48% specificity, 95% CI: 0.31-0.67) with 
an odds ratio of 3.16 (95% CI: 1.10-9.1 1, p=0.03). In univariate analysis for response, 
the predictive signature appeared to be superior, i.e. more than 2-fold higher odds ratio, to 
most traditional factors (i.e. menopausal status, disease-free interval, first dominant site 
of relapse, estrogen and progesterone receptor status), of which only estrogen receptor- 
level (odds ratio, 1.54; 95% CI: 1.00-2.40; p= 0.05) and progesterone receptor-level 
(odds ratio, 1.37; 95% CI: 1.05-1.79; p= 0.02) showed significant predictive value. In 
multivariate analysis for response, the signature-based classification did not significantly 
(increase in X 2 = 1.45) add to the traditional based-factor score (data not shown). 

Progression Free Survival 

In addition, in univariate analysis only the 44-gene signature (hazard ratio, 0.54 
[95% CI: 0.31-0.94]; p=0.03) and progesterone receptor-level (hazard ratio, 0.83 [95% 
CI: 0.73-0.96]; p=0.01) were significantly correlated with a longer time until tumor 
progression and this was retained for the signature in the multivariable analysis (hazard 
ratio, 0.48 [95% CI: 0.26-0.91]; p=0.03). Progesterone receptor is also independent, but 
with a less striking hazard ratio (0.82 [95% CI: 0.71-0.94]; pO.Ol). After addition of the 
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* * 2 

signature-based classification to the traditional-factors-based score the increase in X was 
5.18 (dff=l, p=0.02), indicating that the predictive signature independently contributed to 
the traditional predictive factors for progression free survival, hi Kaplan-Meier analyses, 
the median difference in progression free survival time for patients with a favorable and 
poor response was 2-fold longer when the 44-gene signature (Figure 3c) was used in 
comparison to the traditional factors-based score without (Figure 3a) and with PgR 

* 

(Figure 3b) (i.e. 11 months versus 5 months, see Figure 3). 
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Table 1 

Univariable and multivariable analysis for PFS after start of tamoxifen treatment In 
the validation set of 66 patients with advanced breast cancer. 







Univariable (N=66) 






Multivariable (N-66) 




Traditional factors: 


HR 


[95% CI] 


P 


HR 


[95% CI] 


P 


Menopausal status 3 


1.07 


[0.57-2.00] 


0.83 


1.16 


[0.58-2.33] 


0.67 


Dominant site of relapse: 














Bone to soft tissue 


j 1 .56 


[0.70-3.47] 


0.28 


1.80 


[0.76-4.26] 


0.19 


Viscera to soft tissue 


1.26 


[0.47-2.79] 


0.57 


1.42 


[0.60-3.34] 


0.42 


Disease Free Interval b 


0.92 


[0.53-1.57] 


0.75 


1.08 


[0.61-1.90] 


0.80 


Log ER 


0.83 


[0.66-1.06] 


0.13 


0.88 


[0.68-1.14] 


0.33 


Log PgR 


0.83 


[0.73-0.96] 


0.01 


-.0.82 


[0.71-0.94] 


0.01 


Micro array 


44-gene signature c 


0.54 


[0.31-0.94] 


0.03 


0.48 


[0.26-0.91] 


0.03 



a) Menopausal status: post- vs premenopausal; 

b) DFI: > 12 months vs < 12 months; 

c) 44-gene signature: sensitive vs resistant. 



Example 3 

Independent confirmation of gene-expression 

The mRNA expression levels of 8 genes of the 81 -gene signature were analyzed 
by quantitative real-time PCR. The genes included: CASP2, DLX2, USP9X, CHD6, 
MST4, RABEP, SIAH2, and TNC. qPCR data was correlated with microarray data. 
Spearman rank correlations were positive for all genes but MST4. 
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Example 4 

Functional Analysis of the 81-Gene Predictive Signature 

The 81 -gene signature described above in Example 1 was analyzed for the 
functional aspects of the genes contained in the signature. The genes were examined for 
functional relationships using Ingenuity Pathway Analysis tools. (Mountain View, CA) 

The signature contains 15 ESTs and 66 known genes (see Figure 2). Functional 
annotation of the genes in the signature showed genes involved in estrogen action (26%), 
apoptosis (14%), extracellular matrix formation (9%), and immune response (6%). The 
remaining genes function in glycolysis, transcription regulation, and protease inhibition. 

The patterns of expression of many genes that are associated with anti-estrogen, 
for example, tamoxiphen resistance and sensitivity are highly complex. The 81 
differentially expressed genes includes, as expected, genes regulated by or associated 
with estrogen (receptor) action (van 't Veer, et al., 2002, Nature, 415:530-6; Tang, et al., 
2004, Nucleic Acids Res., 32 Database issue:D533-6; Pusztai, et al., 2003, Clin. Cancer 
Res., 9:2406-15; Gruvberger, et al., 2001, Cancer Res., 61:5979-84; Charpentier et al., 
2000, Cancer Res., 60:5977-83; Frasor, et al. 2003, Endocrinology, 144:4562-74), but 

also genes involved in extracellular matrix formation and apoptosis. 

■'■ -- . . •** 

- v Seventeen genes were regulated by or associated with estrogen (receptor) action, 
of which 9 genes showed upregulation (LOC51186; TSC22; T1MP3; SPARC; 
GABARAPL1; CFP1; LDHA; EN02; Hs. 99743) and 8 genes downregulation (TXN2; 
CDC42BP4; HLA-C; PSME1; Hs. 437986; SIAH2; UGCG; FMNL) in the primary 
tumors of patients who were resistant to tamoxifen therapy for recurring breast cancer 
(see Figure 2). Several of these estrogen (co-)regulated genes (LDHA, TXN2, and 
SIAH2) have been linked to apoptosis. 

A cluster of 6 genes was identified as associated with the extracellular matrix 
(ECM). These genes, TIMP3, FN1, LOX, COL1 Al, SPARC, and TNC were 
overexpressed in the primary tumors of patients that demonstrated resistance to anti- 
estrogen, for example, tamoxiphen therapy for treatment of recurring breast cancer 
(progressive disease). 
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Besides cytostatic effects, the anti-estrogen tamoxifen is known to have cytolytic 
effects by induction of apoptosis, as reviewed by Mandlekar and Kong (Mandlekar, et al. 
2001, Apoptosis, 6:469-77). Based on Swiss prot, PubMed, and Ingenuity analysis 
information, nine genes (LOC51186; TSC22; TTMP3; SPARC; GABARAPL1; CFP1; 
LDHA; EN02; Hs. 99743) in the 81 -gene signature are related to programmed cell death, 
of which three genes inhibit apoptosis (API5, NPM1, and TXN2) whereas three other 
genes induced apoptosis (CASP2, MAP2K4, and SIAH2). Interestingly, the two latter 
genes (MAP2K4, SIAH2) induce the apoptotic machinery of fibroblasts. 

Seven genes of the Signature were associated with apoptosis (IL4R, LDHA, 
MAP2K4, NPM1, SIAH2, CASP2, and TXN2), whereas two other signature genes 
(API5, BNIP3) were related to anti-apoptosis processes. 

In general, the expression patterns indicate that anti-estrogen, for example, 
tamoxiphen resistance is mainly associated with inhibition of apoptosis. Interestingly, 4 
apoptosis genes (APIS, NPM1, LDHA, and BNIP3) were upregulated and 5 genes (IL4R, 
MAP2K4, SIAH2, CASP2, and TXN2) were downregulated in primary tumors of 
patients that were resistant to anti-estrogen, for example, tamoxiphen therapy for 
treatment of recurring breast cancer (progressive disease). 

Example 5 
Specifc Set of Useful Marker Genes 

Ten genes selected from the 81 -gene signature (CHD6, FN1, TNC, CASP2, 
EZH1, RABEP1, THRAP2, SIAH2, DEME-6, COX6C) were analyzed to date against 
272 tumors. Uni- and multivariable analyses were performed to determine the response 
and duration of response (progression free survival), using the methods described above. 
In multivariable analysis the individual genes were compared with the clinically used 
model of traditional predictive factors (i.e., menopausal status, disease-free survival, 
dominant site of relapse, log ER, and log PR). 

Specific calculation of threshold values (cutpoints) (Table 3) for prediction of 
Overall Response and Progression Free Survival were calculated as described above. As 
shown in Tables 3-6, marker genes DEME-6, CASP2, and SIAH2 were useful as 
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individual markers of clinical outcome. Reliability of the prediction increased with 
combinations of the markers (see Tables 5 and 6). 

None of the ECM genes in the signature is found in the 70- gene classifier for poor 
prognosis of node negative breast cancer patients by van c t Veer et al. (van 't Veer, et al., 
2002, Nature, 415:530-6), suggesting that the ECM gene cluster is specific for the 
prediction of tamoxifen resistance. Furthermore, SPARC/osteonectin, a myoepithelial 
cell marker that is estrogen (co-)regulated, was recently described as an independent 
marker of poor prognosis in unselected breast cancers (Iacobuzio-Donahue, et al., 2002, 
Cancer Res., 62:5351-7; Mackay, et al, 2003, Oncogene, 22:2680-8; Jones, et al., 2004, 
Cancer Res., 64:3037-3045). In addition, a new cluster of genes linked to the immune 
system (FCGRT, PSME1, HLA-C, and NFATC3) was downregulated in the patients with 
progressive disease compared to those with objective response. 

The 81 -gene signature showed an overrepresentation of genes located to 
chromosome 17, but an under representation of genes located to chromosomes 4, 15, 18 
and 21 (Figure 4). Genes localized to cytoband 17q21-q22 appeared to be significantly 
(p=0.03) overrepresented, i.e. 5 of 66 informative genes (i.e. APPBP2, COL1A1, EZH1, 
KIAA0563 and FMNL) in the signature (6.5%) compared to 199 of 12771 known genes 
" (1 .5%) for the whole microarray. 

DISCUSSION 

The studies described above in Examples 1-4 demonstrate that expression array 
technology can be effectively and reproducibly used to classify primary breast cancer 
tumors according to a predicted resistance or sensitivity to anti-estrogen, for example, 
tamoxifen treatment for recurring breast cancer. An 81 -gene signature with multiple 
individual genes predictive of response and outcome, alone or in combination with other 
genes is described and validated. A 44-gene signature is described that predicted anti- 
estrogen, for example, tamoxiphen therapy outcome in 1 12 breast cancer patients with 
ER positive recurrent disease. Overall, a prediction of anti-estrogen, for example, 
tamoxiphen resistance was accomplished with an accuracy of 80%. Moreover, the 44- 
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gene signature predicted a significantly longer progression free survival time that is 
superior to the prediction obtained by a traditional factors-based score. Differences in 
RNA expression were confirmed by quantitative real-time PCR. 

The predictive value of the 44-gene signature compares favorably and contributes 
independently with that of traditional prognostic factors, including the estrogen receptor, 
currently the validated factor for response prediction to hormonal therapy in breast 
cancer. The estrogen receptor, present in about 70-75% of breast cancers, correctly 
predicts response to tamoxifen in about 50-60% of the patients (Osborne, 1998, N. Engl. 
J. Med. 339:1609-18), while the gene signature predicts resistance to tamoxifen in 77% 
of the patients in the validation set. 

The present 44-gene signature, due to its significant association with time to 
treatment failure, may be used to classify patients based on time to treatment failure. 

In general, the arrays used in these different studies comprise different 
genes/ESTs than those disclosed in the prior art. Of these arrays, approximately half of 
the genes show overlap. This could result in few overlapping genes in the generated 
gene-signatures. Therefore, comparison of pathways based on the extracted gene 
signatures from different studies could be more informative. . 
At present, none of these differentially expressed genes that are regulated by or 
associated with estrogen (receptor) action have been directly linked by others with 
endocrine resistance in clinical samples. The data described herein provides a better 
understanding of endocrine resistance and provides novel potential therapeutic targets for 
individualized treatment. 

A diagnostic assay was recently developed by Genomic Health, the Oncotype DX 
diagnostic assay based on a candidate gene selection (not genome wide) approach. This 
test provides a recurrence score for lymph node negative breast cancer patients with 
estrogen receptor positive tumors that have received adjuvant tamoxifen (Paik, et al., 
2003, Breast Cancer Res. Treat., 82:S10). Their multiplex 21 -gene test includes genes 
associated with proliferation, estrogen and HER2 action, invasion and 5 control genes. 
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None of the genes however, overlap with the 81 -gene signature that was selected through 
microarray based gene expression profiling. 

Recently, Sgroi et al. (Ma, et al., 2004, Cancer Cell, 5:607-16) also analyzed 
tumors from patients with adjuvant tamoxifen therapy using microarray analyses. They 
extracted a two-gene ratio that predicts "a tumor's response to tamoxifen or its intrinsic 
aggressiveness, or both". Interestingly Sgroi et al. (Ma, et al., 2004, Cancer Cell, 5:607- 
16) showed that HOXB13, located to 17q21 was overexpressed in tamoxifen resistant 
cases with recurrence after adjuvant tamoxifen. In the 81 -gene signature, we observed 5 
genes located to chromosome 17q21-22 that could be of importance for tamoxifen 
resistance. In this region, the signature gene COL1 Al was discriminative and highly 
expressed in the signature. Moreover, HOXEJ13, like COL1A1, is not positioned in the 
17q21 HER2/ERBB2 amplicon (Hyman, et al., 2002, Cancer Res., 62:6240-5) but in the 
second of three regions (i.e. 17ql2 -HER2-, 17q21.2 -HOXB2-7-, 17q23 -PPM1D-) 
highly amplified in breast cancer. This implies that genes other than those of the ERBB2 
amplicon region, like HOXB13 and COL1 Al are important for resistance to tamoxifen 
and present potential therapeutic targets. 

The expression of the other 4 signature genes located to chromosome 17q does 
not correlate with the ERBB2 expression, since they (EZH1, FMNL, KIAA0563, and 
APPBP2) were down regulated in the tamoxifen resistant tumors. This region has been 
implicated for LOH in 30% of breast cancer cases (Osborne, et al., 2000, Cancer Res., 
60:3706-12). Only recently, JlJP/plakoglobin/gamma-catenin was identified as a LOH , 
whereas LOH of BRCA1 is frequently observed in high-grade tumors (Ding, et al., 2004, 
Br. J. Cancer, 90:1995-2001). The signature gene EZH1 located between JUP and 
BRCA1 may, therefore, be another LOH candidate gene. 

Numerous reports have described that ERBB2 amplification and over-expression 
in ER positive patients is associated with a reduction in response rate to first-line 
hormone therapy (Lipton , et al., 2003, X Clin. Oncol., 21:1967-72; Ferrero-Pous, et al., 
2000, Clin. Cancer Res., 6:4745-54; Wright, et al., 1989, Cancer Res., 49:2087-90). 
Since the expression patterns of the 5 signature genes on 17q21-q22 are not significantly 
correlated with ERBB2 expression in this array study, this indicates that another, yet 
unknown, mechanism may be activated. 
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An 81 -gene signature of differentially expressed genes and a 44-gene signature 
that predicts anti-estrogen, for example, tamoxifen therapy resistance and time to 
progression in ER-positive breast cancer patients with recurrent disease have been 
developed. The gene signatures demonstrate a significantly better performance than the 
commonly used traditional clinical predictive factors in uni- and multivariate analyses, 
and (3). In contrast to the traditional factors site of relapse and disease free interval 
(DFI)_, the prediction of response can be derived from the gene-expression profile of 
primary tumors. 

Objective Response, Stable Disease, and Progressive Disease 

The 81 -gene signature was validated with quantitative PCR analysis (as described 
above) on RNA obtained from a larger series of 272 tumors from breast cancer patients 
who underwent first-line tamoxifen therapy for advanced disease. Included were patients 
having stable disease. Of these, 59 showed an objective response, 120 had stable disease, 
and 93 had progressive disease. 

Ten genes selected from the 81-gene signature (CHD6, FN1, TNC, CASP2, 
EZH1, RABEP1 , THRAP2, SIAH2, DEME-6, COX6C) have been analyzed to date 
against all 272 tumors. Uni- and multivariable analyses have been performed to 
determine the response and duration of response (progression free survival). In 
multivariable analysis the individual genes were compared with the clinically used model 
of traditional predictive factors (i.e., menopausal status, disease-free survival, dominant 
site of relapse, log ER, and log PR). 

Clinical implications for patients predicted to have a poor response to tamoxifen 
therapy are that these patients should be candidates for other treatments or novel 
therapies, based on different targets present in their tumor profiles. This will reduce the 
use of ineffective treatments. 



24 



WO 2005/054510 PCT/iB 2 004/004 405 




2 



to 

CO 
CD 
O 
O 



| 

Xv 

co 

M 
CO 

co 



X 

CO 

o 
co 

CO 



CO 



CO 

co 
-o 



X 

co 

CO 

co 
to 



-4 

co 

CO 



8 

CO 

o 

CO 



X 

to 

ro 

o 
x» 



H 

O) 
Co 
CD 

ro 
cn 



X 

to 

cn 
-o 

-4 

CO 

o 



cn 

CD 

cn 



Co 



I 

4v 

2 



X 

to 

Co 

4V 

o 

CO 



CD 
CO 
CO 



5 

CO 

"O 
CO 
00 
CO 



X 

to 

to 
**4 
-»4 

--4 



CD 
cn 
co 



<o 

to 
Xv 
cn 



o> 
o> 
to 
co 

cn 



X 
to 

to 
cn 

co 
o 



cn 
g 



CD 
O 



to 
cn 
-4 



X 

to 

t» 
co 

CD 

o 



CO 



Co 
— k 

CO 
CO 



I 

a> 
co 
to 

CD 
CO 



X 

to 

CD 
CO 
Co 

XV 

CD 



CO 
CO 



-si 

ro 

-4 



cn 
o 

CO 



cn 
o 



o 



X 

CO 

to 
to 
-o 

4X 
CO 



to 
cn 



to 

OO 
CO 
CO 



M 
CD 
CD 

-4 

cn 



o 
oo 
co 
cn 
to 



X 

to 



cn 

CD 

o 



x* 
co 
o 
to 



CO 

cn 



X 
to 

-j 

CO 

CO 

cn 



2 



X 

to 

•u 
o 

cn 
cn 



to 

ro 



co 
ro 
to 



cn 



3 

CO 

-J 
cn 



X 

to 

* 

CO 
CO 

2 



CO 



X 

to 

CO 
CO 

cn 

CO 



<o 
o 



"Si 

o 

CO 

o 



co 
it 

CO 
CO 



> 

o 

CD 
CO 
IO 



X 

to 

o 

CO 

cn 
to 
o 



3 



I 

ro 

cn 

CD 

cn 
o 



X 
to 

fO 
O 

2 



00 



CO 

o 

CD 



X 

to 

ro 
-o 

CO 

cn 



ro 

-4 



-4 

CO 

Co 



o 

CD 
CO 
CD 



I 

CO 



CD 
-4 



X 

to 

ro 

CD 
-si 
CO 

o 



to 
cn 



co 
ro 

CO 



cn 
co 

CO 

x^ 
ro 
cn 



X 

to 

to 
o 

XV 



ro 

cn 



o 
to 
cn 



CO 
CO 

ro 
ro 



cn 

CO 
CD 
-O 

cn 

CD 



X 
to 

CO 
cn 
cn 
-J 

co 



X 

■O, 
ro 
cn 

CD 

CO 



X 

to 

CO 
CD 

ro 
o 



ro 

co 



co 
ro 

Xv 
CD 



ro 
to 

XV 

O) 
X* 



$ 

X*- 
OJ 
CO 

cn 
co 
co 



X 

to 

o> 
cn 
to 
to 

CD 



X 

to 

CO 
CO 

2 

ro 

CD 



CO 



X 

to 

Jv 

it 
-4 



ro 



Xv 

cn 
cn 
co 



to 
cn 

CO 

cn 

CO 



X 

to 

ro 

cn 
-o 

CO 



to 



CO 

cn 
to 



ro 
cn 

CO 

cn 
co 



to 

CO 
CO 
CO 
CO 
CO 



X 

to 

-4 

CO 

cn 



CO 



cn 

cn 
en 



1 

Xv 
CO 
CD 
CO 

to 



CO 
CO 

Co 



CO 
-4 

cn 

CO 



cn 

•tv 

tO 



to 

CO 
CO 



X 

CO 

ro 

co 
to 

4v 



cn 



x» 

o 

en 



ro 

CD 
CO 

-4 
ro 



CO 
-O 
CD 



CO 
CO 



ro 

CO 

cn 
cn 
to 
co 



X 

to 

X* 
Xv 

o 
cn 
oo 



Co 



CD 

ro 
to 
o 



ro 
-o 

CD 
Xv 



cn 
to 

CD 

to 
cn 
cn 



X 

to 

ro 

CO 

to 



ro 



cn 

CD 



X 

CO 

ro 
ro 
to 
cn 

CO 



ro 
ro 
co 
co 



co 
cn 
cn 
x* 



cn 
cn 
cn 
cn 
to 



X 

to 

co 
co 



CD 



ro 
cn 

-4 

co 



ro 

CD 

co 

CO 
CO 



70 

4V 

CO 
CO 



00 



-0 
Xv 

CO 

cn 



X 

in 

-jk 
o 
ro 
ro 
cn 



CO 

cn 
o 
© 



CO 

o 



73 

CO 

ro 
cn 

ro 



X 

co 

ro 

CO 
-4 
CO 

ro 
o 



CO 



ro 
cn 

CO 

ro 



ro 

-4 
CO 

-o 

Co 



-si 

to 
ro 
o 
ro 



X 

to 

ro 

Xv 

cn 

CO 
CD 



cn 



cn 
cn 



ro 

to 

CO 
CO 

co 



X 

to 

CO 

to 
cn 



cn 
cn 
cn 
cn 



co 



Xv 
CO 

CO 



CO 
CO 
CO 
CO 

o 



CO 

cn 
to 

ro 
o 



X 

to 

4v 

cn 
ro 
O 

-4 



Ki 



ro 
cn 

CO 



o 
cn 

--4 
CO 



X 
CO 

cn 

CO 

cn 

Xv 



X 

to 

cn 

to 

CD 
Xv 



O 

c 

to 



o 



(0 
O 



PC 



> 

n 
p 

o 
o 
a. 



C 

3 

CD* 

3 



2 

CO 

s 



TJ 
XI 



—J 

o 

ro 



2S 

5 



TI 
> 



m 

CD 



m 

O 
ro 



O 
CO 



cn 

CO 



TJ 

CO 
> 



o 
> 



m 



it 



O 
T) 



O 
-n 
tj 



1 

■ 

73 
CO 
TJ 



> 

CO 

> 

7) 

> 



-< 

o 

CD 



CD 



3 



CO 
O 



o 
o 



o 

g 



m 

co 

co 



O 



•o 

-4 

CO 



O 
O 



CO 

o 

ro 
ro 



O 
X 

o 

CO 

i 

■o 
to 
z> 
CP- 
s' 
to 



O 
O 
cn 



O 

o 

3 

CD 

I 

CO 
«< 

3 

CT 

O 



-4 
.O 

43 

CO 

cn 



43 

ro 

CO 



ro 

43 

ro 

CO 
CO 



Xv 

ro 
— i 

CO 



ro 

CO 



-4 
43 

ro 

X 

ro 

co 



co 

T3 

ro 

ro 
< 

ro 



co 
■o 
ro 

Lo 



to 
to 



ro 

TD 

— k 

CO 



ro 

CO 



CO 

ro 

CO 
CO 



o 

X3 

ro 
*-* 

x4> 
ro 
ro 



CO 

cn 
ro 



•o 

CO 
CO 

ro 
co 



a 
ro 

Xv 

X3 

ro 
cn 



-o 
ro 



ro 



T3 

cn 

Xv 



ro 



T3 

ro 



cn 

43 
CO 

cn 



o 



ro 



■a 

co 

— 

T3 

ro 

ro 



ro 
■o 

CO 
CO 



CO 
CO 



ro 

43 

ro 

43 
CO 
Xv 



ro 



o 

XD 

ro 

co 

co 



CO 
CO 

to 



43 

ro 



to 
ro 

ro 



-4 

ro 

cn 

fo 



to 

XD 

CO 
CO 



ro 

CO 
CO 



cn 

CO 

CO 

4i 
CO 

ro 



•o 
x> 
ro 

co 
fo 



cn 

ro 
co 

ro 



ro 

X3 
CO 
4v 



fo 

XD 

ro 

co 



CO 
X3 



ro 

o 

43 

ro 



X 

JD 

ro 
ro 



ro 
ro 



O 

o 

5' 
3 



CO 



CO 



it 



cn 



00 



CO 



CO 

CO 



CO 



to 



CO 



in 

3 




ro 
co 

4> 



> 
TJ 
T3 
CO 
TJ 
ro 



CD 
> 



m 
co 



71 
CD 
TJ 



> 

TJ 



o 

CD 



CD 

m 

2 



O 

X 

> 



> 



2 

TI 



1 

I 

CO 
TJ 



> 
CO 
> 

> 
TJ 



o 

CD 



CO 



to 

T» 

cn 
cn 

Xv 

O 
o 

CO 



O 

X 



Co 
O 
ro 
ro 



< 

to 



CD 



< 

51 
to 

CD 



to 
tn 



£ 

to 

CD 



< 

01 



< 

to 

CO 
Xv 



O 

o 

cn 



CO 

cn 



CQ 

<t> 

3 

CD 



< 

to 

cn 



n 

TJ 

O 



ro 
co 
cn 



o 

CO 
— *. 

co 



o 

• 

o 
ro 
co 



4v 

o 



o 

CO 
cn 



cn 
cn 



ro 

Oi 
CD 



o 

CD 

ro 



o 

co 



CO 
CO 



ro 

Xv 



CD 



ro 
co 



o 
o 

—V 

CD 



O 

ro 



ro 

cn 



ro 

4v 



o 

co 

CO 



cn 
o 



o 

CD 
CO 



o 

-4 
CO 



ro 

to 



CO 

cn 



CO 



Xv 
O 
CD 



o 



ro 

o 
to 



to 

CO 

ro 



p 

to 
ro 

CO 



o 

CO 



CO 



0l 

tr CO 

2. 3 
m 

3 



o 

73 



tt» 
to 
■o 
o 

3 

(0 

to 



c 

3 

< 
0> 

S3' 
o 

01 
3 
W_ 

»< 

to 

CO* 



to 
cn 



X 

T3 



tJ 

-n 
co 



to 
cn 

o 



25 



WO 2005/054510 



PCT/IB2004/004405 




cn 
cn 
o 



~4 
cn 
ro 



4^ 
co 
-4 
co 



to 

3 



CO 

-J 

CD 
O 



00 
O) 



CO 

3 



*-4 



4^ 

o 
-J 
4* 



CO 

ro 



co 



CO 

-4 



cn 
oo 
cn 

CD 



2 

to 

C*> 



CD 



00 
CO 
CO 

cn 



to 

3 



cn 



CO 

ro 

CO 

cn 



cn 
ro 



o 

CD 
CO 



CO 
CO 



CO 
00 

4* 



CO 

o 
co 



cn 
*o. 



to 
-u 
oo 
4* 



■p. 
cn 
to 
o 



oo 

CD 



cn 



CD 



ro 



CO 

o 

09 



ro 
co 

oo 



2 

to 



CD 
O 

ro 
co 



oo 

CD 

cn 
ro 



o> 

t» 



CO 

o 

CO 
CD 



O0 
CO 

ro 
o 



ro 
o 
cn 
to 



ro 
co 

CD 



CD 
CO 

o 



£ 

o 



CD 
CO 

cn 

-•4 



ro 



CD 



cn 
ro 
co 



cn 
oo 

CD 

co 



cn 
ro 
cn 
co 



ro 

oo 

CD 

ro 

co 



ro 

cn 

O0 



2 



CO 



CD 



OO 
OO 

ro 



*"4 
CO 



cn 



-•4 
00 

cn 



o 
o 
co 



co 



o 
o 

2 

ro 

CD 



o 
a 
co 
o 
co 



o 
ro 

-J 
ro 

•fx 



o 
o 
cn 
co 



CO 
CO 

o 

CO 



o 

4*. 

ro 

o 

CD 



o 
o 

cn 



ro 



ro 
3 



I 

ro 

CD 
CO 

cn 
ro 



I 

cn 
cn 
to 
co 



cn 

2 



CD 

2 
3 



CD 

cn 

CD 

ro 

00 



£ 

o 
ro 
co 
o 
4* 
ro 



1 

2 
2 

CO 



CD 

to 
o 

cn 



£ 

Co 
->J 

ro 
ro 

CO 



X 
to 



CD 

CD 
O 



to 
CO 

cn 

CO 

cn 



X 

co 

to 

CD 
4v 
O 

ro 



X 
cn 

o 
o 
ro 



X 

trt 

CO 
CO 

to 

CO 

o 

CO 



X 

trt 

CO 

CD 
O 

cn 



to 

fo 
o 

— I 

to 



X 

to 

CO 
CO 

CO 

ro 

CO 



X 

trt 

o 

CO 

to 



CO 

o 

CD 
CD 
CO 



"O 

CO 



4v 
cn 



! 

e 

ro 

CD 



£ 

cn 
-4 
to 



to 

-4. 
~>1 
CO 



CO 
CD 
-U 

ro 

to 

CO 



I 

cn 
cn 
cn 

CD 



73 
to 

S 

O) 



I 

oo 

ro 
-4 
cn 



I 

o 

CO 



£ 

CD 
CO 

co 
o 



CD 

-4 

CO 



£ 



£ 



X 

to 

■u 
cn 
cn 
co 
co 



X 

in 



X 

trt 

2 

cn 



X 

to 

CO 

o 

-x 

CD 
O 

4*. 



X 

trt 

ro 



-4 



to 

I 



X 

cn 

o 

CD 



X 

to 

CD 
CD 

ro 



X 

trt 

* 

-4 

cn 

2 

cn 



X 
to 

ro 



x 

CO 

it*, 
o 

CO 



X 

Crt 
CO 

to 
o 

CO 



X 

trt 

CO 
CO 
CO 

to 



X 

to 

2 

— v 

CO 

o 



73 

CO 

ro 
4X 

CD 



X 

trt 

ro 
cn 
o 
cn 
co 

CD 



X 

to 

Co 

CO 

cn 

CD 

co 



X 

co 

_i. 

CO 
Xi. 

CD 
CD 



X 
to 

ro 



x 

to 



CO 
CO 



X 

to 

4^ 
O 

cn 

CO 

co 

CO 



X 

to 

CO 

2 

CD 
CD 
O 



I 

o 
ro 
o 

CO 



CO 

> 
X 



73 
<5 



4*. 



o 
ro 



o 
O 

73 



T) 

Co 



X 



m 

M 
X 
r 3 



D 
O 

CD 



O 

I 

o 



c: 
-n 
O 



1 

cn 

CD 

CO 



O 

o 



CO 



CO 

> 

X 
ro 



TJ 
> 

O 

CD 
CD 



Q 
ro 



O 
O 
73 



73 

CO 

m 



X 

I 

o 



-n 

CO 



5 

X 
ro 



73 




Tt 

73 
TJ 
CO 

ro 



CD 

cn 
m 
tj 



co 
m 

73 
TI 

2: 
cn 



O 

CD 
73 



£ 

o 
to 
to 
to 



O 
X 



TJ 
F3 

TJ 



CO 
CO 



CO 
TJ 

CO 
CO 



2 

ro 



-4 
43 

to 

CO 



ro 



ro 



ro 



ro 
o 

co 
ro 

4* 

CO 
CO 



-4 

-o 



ro 



o 

43 



Fo 



ro 



t 



-4j 
-Q 
CO 

cnl 



ro 
o 

Co 



ro 



CO 
X) 

ro 
ro 

CO 



ro 
to 



ro 



-4 
43 

ro 



-4 
43 

to 

CO 



TJ 
CO 

ro 

co 



ro 

43 
CO 

ro 



to 

JO 
CO 



ro 

CD 



CO 
JQ 

ro 
cn 



ro 
o 
o 

ro 



to 

43 

Fo 

CO 



-4 
43 
CO 

ro 



to 

43 

CO 
CO 



4a. 
43 



ro 



43 

ro 

CO 



CD 

tj 
ro 

co 



CD 
43 

ro 
ro 

ro 



to 

43 
CO 
CO 



O 
43 

ro 



CD 



ro 

< 

ro 



T3 



ro 



ro 

43 

ro 
ro 



-o 

43 

4^ 

j. 
43 

to 
cn 



-4 
TJ 



ro 

* 

■o 
ro 



to 

CO 



•^4 
TJ 

CO 

ro 



CD 
TJ 

ro 
cn 



j3 

ro 



ro 
ro 

43 
CO 



CO 

ro 
ro 

CO 



ro 

co 

co 



co 

J3 

CO 

4- 
CO 



cn 

43 



ro 



ro 

-4 



to 



CO 
-4 



CO 

ro 



ro 



cn 



ro 




g 

ro 

£ 



+ 
« 

-4 

O 



O 

o 
ro 



o 

CO 



to 
ro 



p 

bo 

4*. 
-4 



o 
ro 



2 



to 

CO 



to 



2 



ro 

2 



CO 
CD 



ro 

2 



£ 

ro 

2 



£ 

to 

CD 



CD 



O 

CO 
4x 
CD 



O 

CO 

Co 



o 



CO 

o 

-4 



o 
o 



CD 



CD 

to 



o 

CO 

to 

-4 



O 

CO 
CO 

ro 



co 
co 



CO 



■ 

oo 

C3 



4^ 



O 

to 

CD 
4^ 



O 
g 

cn 



o 

CD 
-4 



is 



CO 
CO 



o 
cn 



o 
to 

CD 



co 



£ 

ro 

£ 



■ 

o 
o 



CD 

In 



o 

o 
o 

-4 



< 

ro 

2 



ro 

£ 



o 

I 

CD 

o 



< 

EL 
ro 

CD 



c 
a 



< 



cn 



ro 
ro 

CD 



to 
cn 
o 



o 

4*. 
CO 



o 
bo 
ro 



o 



ro 

CO 

cn 



o 

CD 

ro 



o 
o 



p 
bo 

CD 



CD 
CO 



O 
O 
CD 



O 



(O 



CO 
CO 



ro 

co 

CD 



co 
ro 



ro 

co 
oo 



to 
ro 



co 
co 



o 

CO 

o 



o 

2 



o 

CO 

ro 



o 
o 



7s 

£ 




cn 



ro 

CD 
4> 



ro 
ro 



cn 

CD 
CO 



o 

* 

o 
o 



to 



ro 
o 

CD 



-4 
CO 

ro 



o 
o 
o 
o 



CD 

to 



CO 
O 



o 

<7> 
73 
— I 



< 

Si. 
to 
cn 



T» 

CO 

m 



"71 



o 



cn 



to 

CO 



O 
O 
ro 
co 
co 

CO 



> 



< 



2 



Tl 
73 
Tl 
CO 
> 

rS 



to 



CD 

ro 



< 

CO 
CD 



co 
m 

73 
03 




O 

ca 

73 



2 



ro 
ro 



< 
ro 

2 



ro 

4^ 



fO 

00 



26 



WO 2005/054510 



PCT/IB2004/004405 



Table 3 

Suggested Threshold Values for Predictive Outcome 



Overall Response 
Outcome 


Threshold 
Value 


Significance 
PadJ 


DEME-6 


9M6 


. 0;0096 


SIAH2 


1.16 


0;0283 


CASP2 


0;94 


0,0085 


THRAP2 


1.16 


0.226 


FN1 


140.87 


0.0701 




Progression Free 
Survival Outcome 


Threshold 
Value 


Significance 
Pad] 


DEME-6 


9.38 


0,0115 


SIAH2 


0.76 


0.0206 


THRAP2 


5.02 


0.385 


TNC 


2.08 


0.254 



Table 4 

Regression Analysis of Individual Marker Genes 



OUTPOINTS RESPONSE 


















Univariate Regression 


N 


OR 


P 


95%CI 




HR 


P 


95%CI 


DEME-6 


. 240 


2.97 


<0.001 


1.65 


5.38 




0.60 


<0.001 


0.45 


0.79 


SIAH2 


■» . • • . - * • • -. ■ 
.242 


2.47 


6.002 


1.41 


4.34 




0,65 


0.003 


0.48 


0.86 


CASP2 


235 


0.35. 


<0;001 


0.20 


0.61 




1.33 


0.037 


1.02 


1.75 
























Multivariate Regression 


N 


OR 


P 


95%CI 




HR 


P 


9 


5%CI 


DEME-6 






0;0012. 


1.51 


5.34 




0.58 


0.0002 


0.43 


0.77 


SIAH2 


.... ..... 

; - .' \242:. : ■ 


i::-:^;4o- 


0.0079 


1.26 


4.59 




0.71 


0.028 


0.52 


0.96 


CASP2 


235 ; 


•.. -. .».«•,* • • * 

'0.33 


.0,00044 


0.18 


0.61 




1.39 


0.022 


1.05 


1.85 



N = number of tumor samples analyzed 

OR = Objective Response (OR >1 correlates with positive resonse to anti-estrogen therapy) 
HR = Hazard Ratio (HR < 1 correlates with positive response to anti-estrogen therapy) 
P = Significance value; p<0.05 is desired 
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Table 5 

Multivariate Regression Analysis of Marker Gene Combinations 



Marker Genes 


N 


OR 


P 


95%CI 




HR 


P 


95%CI 


DEME-6 CASP2 


231 




















DEME-6 




3.08 


0.00088 


1.69 


5.99 




0.58 


0.0003 
1 


0.44 


0.78 


CASP2 




0.31 


0.00029 


0.16 


0.58 




1.42 


0.015 


1.07 


1.89 
























DEME-6 SIAH2 


236 




















DEME-6 




2.44 


0,0069 


1.28 


4.66 




0.61 


0.0008 

8 1 


0.46 


0.82 


SIAH2 




1.89 


0.064 


0.96 


3.72 




0.78 


0.12 


0.57 


1.07 
























CASP2 SIAH2 


232 




















SIAH2 




2.45 


0,0091 


1.25 


4.79 




0.74 


0.058 


0.54 


1.01 


CASP2 




0^32 


0.00045 


0.17 


0.61 




1.35 


0.036 


1.02 


1.80 



0 

Table 6 

Multivariable Regression Analysis of Marker Gene Ratios 



Marker Genes 


N 


OR 


P 


95%CI 




HR 


P 


95%CI 


DEME-67GASP2 


231 


: 156 


,0.00053 


1.21 


2 




0.85 


0.0023 


0.76 


0.94 


DEME-6/SIAH2 


236 


0.96 


0.74 


0.75 


1.23 




1.05 


0.46 


0.93 


1.18 


CASP2/SIAH2 . 


232 


0.66 


0.0005 


0.53 . 


0.84 




1.19 


0.0007 
7 


1.08 


1.32 
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We claim: 

1 . A set of marker genes comprising two or more genes identified in Table 1 as 
differentially expressed in primary tumors of recurring breast cancer patients 
exhibiting a outcome to anti-estrogen therapy, with a significance of p<0.05. 

2. The set of marker genes of claim 1, comprising two or more genes of the 81 -gene 
signature listed in Table 1 , 

3. The set of marker genes of claim 1, comprising two or more genes of the 44-gene 
signature listed in Table 1 . 

4. The set of marker genes of claim 1, comprising one or more genes selected from 
FN-1, CASP-2, THRAP-2, SIAH-2, DEME-6, TNC, and COX-6C. 

5. The set of marker genes of claim 1, comprising one or more of TNC, SIAH-2, 
DEME-6, and COX-6C. 

6. The set of marker genes of claim 1, comprising one or more of FN-1, CASP-2, 
THRAP-2," SIAH-2, and DEME-6. 

7. The set of marker genes of claim 1, comprising one or more of DEME-6 and 
CASP2, and one or more of SIAH-2 and TNC. 

8. The set of marker genes of claim 1, comprising the 44-gene signature listed in 
Table 1 . 

9. A nucleic acid probe comprising a marker gene as defined in any of claims 1-7, or 
a complementary polynucleotide thereof, or a fragment thereof comprising at least 
10-50 contiguous nucleic acids. 
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10. A nucleic acid probe comprising a complementary polynuclotide of the nucleic 
acid probe of claim 9. 

11. An assay system for diagnosing patient response to anti-estrogen therapy for 
recurring breast cancer, comprising a set of marker genes or nucleic acid probe as 
defined in any of claims 1-10. 

12. The assay system of claim 1 1, wherein said marker genes are disposed on an 
assay surface. 

13. The assay system of claim 11, wherein said nucleic acid probe is disposed on an 
assay surface. 

14. The assay system of claim 1 1, wherein the assay surface comprises an assay chip, 
array, or fluidity card. 

15. An assay system for diagnosing patient response to anti-estrogen therapy for 
recurring breast cancer, comprising binding ligands that specifically detect 
polypeptide encoded by each of the respective marker genes of any of claims 1-7. 

16. The assay system of claim 15, wherein the binding ligands comprise an antibody 
or binding fragment thereof. 

17. A method for predicting outcome of anti-estrogen therapy for recurring breast 
cancer, the method comprising: 

a. analyzing a patient's primary tumor tissue for expression of a set of 
marker genes as defined in any of claims 1-7; and 

b. correlating a Cluster 1 expression pattern of the marker genes in the 
primary tumor with a prediction of Progressive Disease; and 
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c. correlating a Cluster 2 expression pattern of the marker genes in the 
primary tumor with a prediction of Objective Response to anti-estrogen 
therapy. 

18. A method for predicting Progression Free Survival of anti-estrogen therapy for 
recurring breast cancer, the method comprising: 

a. analyzing a patient's primary tumor tissue for expression of a set of 
marker genes as defined in any of claims 1-7; and 

b. correlating a Cluster 1 expression pattern of the marker genes in the 
primary tumor with a prediction of lack of progression free survival; and 

c. correlating a Cluster 2 expression pattern of the marker genes in the 
primary tumor with a prediction of positive progressin free survial. 
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Figure 1 
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Figure 2A 
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Figure 2B 
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Figure 3 
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Figure 4 
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